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Abstract 

Let A z be the Cantor space of bi-infinite sequences in a finite alphabet A, and 
let a be the shift map on A z . A cellular automaton is a continuous, ^-commuting 
self-map <3> of A z , and a ^-invariant subshift is a closed, (#, <r)-invariant subset 
S C A z . Suppose a E A 1, is S-admissible everywhere except for some small region 
we call a defect. It has been empirically observed that such defects persist under 
iteration of <3?, and often propagate like 'particles'. We characterize the motion of 
these particles, and show that it falls into several regimes, ranging from simple 
deterministic motion, to generalized random walks, to complex motion emulating 
Turing machines or pushdown automata. One consequence is that some questions 
about defect behaviour are formally undecidable. 
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A recurring theme in cellular automata is the emergence and persistence of 
homogeneous 'domains' (each characterized by a particular spatial pattern), 
separated by defects (analogous to 'domain boundaries' or 'kinks' in a crys- 
talline solid). Defects were first empirically observed by Grassberger in the 
'elementary' cellular automata or 'ECA' (radius-one CA on {0, 1} Z ) with num- 
bers #18, #122, #126, #146, and #182 [Gra84b,Gra84a] and also noted in 
ECA #184, which was originally studied as a simple model of surface growth 
[KS88b, §111. B], and later as a model of single-lane traffic [Fuk99,Bla03,BF05]. 
Based on Grassberger's observations, Lind [Lin84, §5] conjectured that the 
defects of ECA#18 perform random walks. This conjecture was reiterated by 
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Boccara et al, who empirically investigated the motion and interactions of 
defects in ECA #18 and also #54, #62, and #184 (see Figure 1), and longer 
range totalistic CA [BR91,BNR9lj; see also [IlaOl, §3.1.2.2 & §3.1.4.4]. 

Eloranta developed the first rigorous mathematical theory of cellular automa- 
ton defects in [Elo93a,Elo93b,Elo94,Elo95], and, together with Numelin, proved 
Lind's conjecture in [EN92]. Meanwhile, Crutchfield and Hanson developed 
an empirical methodology called Computational Mechanics [Han93], which 
they applied to ECA#18 [CH92,CH93a] and other CA contrived to act like 
ECA#18 [CH93b], as well as ECA#54 [CH97,CHS01] andECA#110 [CHS01]. 
They also obtained a tight theoretical bound on the number of possible reac- 
tions between two defects [CHS01] (improving an earlier result of [PST86]). 
Finally, using genetic algorithms, they and their collaborators 'bred' CA which 
performed computations such as synchronization or density-classification. A 
careful analysis then revealed that these CA performed their computations 
through propagating and interacting defects; this 'particle-based computation' 
had emerged spontaneously through natural selection [DMC94,CHM98,CHS01] 




(A) ECA #54 




(B) ECA #184 

Fig. 1. Spacetime diagrams showing defect dynamics in two cellular automata. Each 
picture show 120 timesteps on a 300 pixel array (time increases downwards). 
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In two companion papers [Piv07a,Piv07b], we develop algebraic invariants to 
explain why defects persist under iteration of the cellular automaton, instead 
of disappearing. These defects often behave like 'particles', which propagate 
through space until they collide and interact with other defects. In this paper, 
we characterize the motion of these 'defect particles', when the background 
domain is a one- dimensional subshift of finite type which is invariant under the 
action of a one-dimensional cellular automaton. In §1 we formally define 'defect 
particles' and introduce a framework to investigate their motion. Depending 
on the ($, cr)-dynamical properties of the ambient subshift, the defect particle 
falls into one of several 'kinematic regimes', ranging from ballistic motion (§2), 
to a generalized random walk (§3), to the emulation of Turing machines or 
pushdown automata (§4). Sections §2, §3 and §4 are logically independent of 
one another. 



Preliminaries & Notation 

For any L < R G Z, we define [L...R] := {L,L+1, ...,R}, [L...R) : = 
[L...R-1], (L...R] := [L+1...R], etc. We likewise define (-oo.. ..R], [L...00), 
etc. Let A be a finite alphabet, and let A z be the set of all doubly-infinite 
sequences in A, which we write as a = [a z ] z€ z, where a z G A for all z G Z. 
The Cantor metric on A z is defined by d(a, b) = 2~ A( - a ' h \ where A(a, b) : = 
min{|z| ; a z 7^ S z }. It follows that (A z ,d) is a Cantor space (i.e. a compact, 
totally disconnected, perfect metric space). If a G A z , and DcZ, then we de- 
fine au G A v by au := [oujueu- If z G Z, then strictly speaking, a 2+ u G A z+V ; 
however, it is sometimes convenient to 'abuse notation' and treat a z+ u as an 
element of A v in the obvious way. 

We define the shift map o : A z — >A Z by a(a) z = a z+x for all a G A z and z G Z. 
A cellular automaton is a transformation $ : A z — >A Z that is continuous and 
commutes with a. Equivalently, <3> is determined by a local rule <p : A^~ r '" r ^ — >A 
(for some r G N) such that $(a) 2 = (^(a^. ^j) for all a G A z and z G Z 
[Hed69]; we say that $ has radius r. 

A subset S C A z is a subshift [LM95,Kit98] if S is closed in the Cantor 
topology, and er(S) = S. For any U C Z, we define Su := {su ; s G S}. In 
particular, for any q > 0, let S g := S[ ... g ) be the set of admissible q-words 
for S. We say S is subshift of finite type (SFT) if there is some q > (the 
radius of S) such that S is entirely described by S 9 , in the sense that S = 
js G A z ; S[ z ,,, z+q ) G Sg, Vz G Zj. If q = 2, then S is called a Markov subshift, 
and the elements of S2 C A 2 are called admissible transitions; equivalently, S 
is the set of all bi-infinite directed paths in a digraph whose vertices are the 
elements of A, with an edge a ^ 6 iff (a, b) G S2. 

If $ : ^4. z — >-v4 z is a cellular automaton, then we say S is (weakly) ^-invariant 
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if $(S) C S (i.e. $ is an endomorphism of S). For example, the set Fix [$] := 
(a) = a | of &-fixed points is a ^-invariant SFT. Likewise, if p £ N 
and v £ Z, then the set Fix of ($, p)-periodic points and the set Fix [<3> p o cr~ pu ] 
of (<&,p,v)-travelling waves are ^-invariant SFTs. Also, for any p £ N, the set 
Fix [a p ] of p-periodic sequences is a ^-invariant SFT. 

If $ has radius r, then for any q > 0, $ induces a function $ : A q+2r — >A q . 
If S C A z is an SFT determined by a set S q C of admissible g-blocks, 
then ($(S) C S)) -<=>- (<&(S 9 +2r) Q S g ). The monoid of endomorphisms of 
an SFT can be quite huge; see [Kit98, Ch.3] or [LM95, §13.2]. 

If S C A z is a subshift, then we define S" := S ( _ 00 ..._i ] C A^°°-~ 1] to be 
the set of all left-infinite, S-admissible sequences, and define S + := S[i...co) C 
AP""°°} to be the set of all right-infinite, S-admissible sequences. 

Notation & Font conventions: Upper case calligraphic letters (A, B,C, . . .) de- 
note finite symbolic alphabets (of cellular automata, Turing machines, etc.). 
Upper-case boldface letters (A, B, C, . . .) denote subsets of A z (e.g. subshifts); 
lowercase bold-faced letters (a, b, c, . . .) denote elements of A z . Zapf letters 
(a, 6, c, . . .) are elements of A; Roman letters (a, b, c, . . .) are integers. Upper- 
case hollow font (U, V, W, . . .) are subsets of Z, upper-case Greek letters 
(<&, \P, . . .) denote functions on A z (e.g. CA), and lower-case Greek letters 
((f), if}, . . .) denote other functions (e.g. local rules, probability measures). 

We generally indicate related objects by related letters. For example, if C, 1Z C 
A are two subalphabets, then a subshift of £ z would be denoted by L, with 
typical element 1 := [4]*ez e L (where l z £ £), whereas a subshift of 1Z Z would 
be denoted by R, with typical element r := [r z ] ze i £ R (where r z ETZ). 



1 Defect Particles 

Let S C A z be a subshift of finite type, and let $ : A z — >A Z be a one- 
dimensional cellular automaton with $(S) C S. By passing to a 'higher block 
presentation', we can assume that $ is a nearest-neighbour CA and that S 
is a Markov subshift. To be precise, suppose $ has radius r and that S is 
determined by a set S q of admissible g-blocks. Let P := max{2r, q}, let B : = 
A p , and let A C B z be the P -block presentation of A z (see [LM95, Defn. 1.4.1] 
or [Kit98, Fig. 1.4.1]). That is, A is the the Markov subshift defined by the 
digraph with vertex set A p , with an edge [«i, . . . , a P ] ~> [6±, . . . , Bp] iff 6 P — 
a p+ i for all p £ [1...P) (this is sometimes called the de Bruijn digraph of A p ). 
Thus, $ is conjugate to an endomorphism $ : A — >A, which can be extended 
(in an arbitrary way) to a cellular automaton \l/ : B z — >B Z such that ^(A) C 
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A and * 



<3>. Let S be the image of S inside A; then S is a Markov subshift 



of Z , and \P(S) C S. Now replace S with S and $ with 

Let a G A z , then a has a single defect in the interval [i...k] C Z if [oj, aj+i] G 
S2 for all j G" while Oj+i] G" S2 for all j G If z = k, then the 

defect has width 0, and consists of a single inadmissible transition between two 
half-infinite, S-admissible sequences: 

„0 rOOOOOOOT /i\ 

a - [ • • • a i-Zi a i-2i a i-l a i <H+V> a i+2i <H±3 ■ ■ ■ } l-U 

" ' it * ' 



admissible defect admissible 



(here we underline the admissible sequences for visibility). If % < k, then we 
say that . . . , «&) is a defect word of width w := k — v. 

_0 _ r 0000 l (n\ 

a - [■ ■ ■ fl i-3> a i-2> a i-l a i a i+1 ...a k a k+1 , a k+2 , a k+3 . . .j [Z) 



We want to rewrite this defect word as (a 2o _£ , . . . , a ZQ , . . . , a Zo+ ^), where z is 
roughly in the center of the defect. So let L := \w/2]—l and Ro := [w/2\ , so 
that w — L + Ro + 1. If 2 := i + L + 1, then z — Lq = i + 1 and zo + Ro = k, 
as desired. Define d z := a z ' +z for all z G [—Lq...Rq], and rewrite eqn.(2) as: 

_0 _ f rO rO rO 1 /o\ 

d — [ ■■■■> a z -L -2> a z -L -l *-io — *0 ••• d fl a zo+flp+l' a z +fio+2 ■ ■ - J \ 6 ) 

[If = 0, then L = —1, -Ro = 0, and Zq — i + 1, so eqn.(3) is equivalent to 
the zero-width defect in eqn.(l).] For all teN, let a* := <J>*(a). We say the 
defect is ^-persistent if a* has a defect for all t e N. In this case, 

a * = [ ■■■■> 4 t -L t -2i 4 t -L t -l d -Lf- d 0--- d Rt a z t +R t +n fl * t +fl t +2 ; ; J ( 4 ) 

for some z t G Z, i? t G N and L 4 G i?t — 1}. The next lemma bounds the 
growth-rate and displacement of the defect during one ^-iteration. 

Lemma 1.1 Lett EN. Then: 

(a) zt - L t - 1 < z t+ i - L t+1 and z t+l + R t+l < z t + R t + 1. 

(b) zt-Lt-2 < zt+i < Zt + Rt + 1. 

Proof: (a) For simplicity, set t := 1. The boundaries of the defect word can 
advance by at most one unit during each timestep, because $ is a nearest 
neighbour CA and $(S) C S. In other words, zq — Lq — 1 < z\ — L\ and 
also z\ + R\ < zo + Ro + 1- (b) follows because Z\ — L\ — 1 < z\ < z\ + R\, 
because L\ > — 1 and < R\. □ 



The width w t ~ 2L t of the defect word may fluctuate with time. We say that 
the defect is a particle if L := max{L t } and R := max{i? t } are finite (possibly 
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L = —1 and R = 0). Otherwise the defect is called a blight (i.e. its size grows 
without bound over time). We will restrict our attention to particles. It will be 
convenient to treat the particle as having constant width. Hence, we rewrite 
eqn.(4) as 

_t _ [ t t rt rt rt t t 

d — L- • ■ a z t -L-2 a z t ~L~l a--L--- a 0"- a R a z t +R+l a z t +R+2 

[ft ft ft J~ t j~ t t ( 

h. h d_ L ...d ...d R t x r 2 

That is: we pad the left side (resp. right side) of the defect with L — L t 
(resp. R — Rt) of the 'admissible' symbols, if necessary, and then we define 
[ n := a z t -L-n and r n := a l t +R+n f° r all n G N (note that, for convenience, we 
reverse the sign of index n in We say W := R + L + 1 is the width of the 
defect particle. (If all the defects had zero width, then L = — 1 and R = 
and W = 0, so the non-underlined block is empty.) We can now represent the 
defect particle finite automaton. 

A finite automaton is a quintuple (X, J), 0\ T, Q), where X is a finite input 
alphabet, V is a finite state domain, O is a finite output alphabet, T : XxV — >V 
is an update rule, and Q:Ix D — >0 is an output rule. Finite automata are 
models of simple computers: starting from initial state description d 6 T>, and 
fed an input stream (; , i\, x 2 , . . .) G X N , the automaton undergoes a series of 
state transitions d d\ ~> d 2 ~> . . . [where := T(i t , ^)] and produces an 
output stream (oi, 02, 03, . . .) G O n , where o t +i := 0(4, d t ). See [HU79, §2.2]. 

The defect particle in eqn.(5) behaves like a finite automaton with X : = 
a l+2 x v ._ j\l- L " R \ and C := [-L-2 . . . R+l}. The automa- 

ton's inputs are 1* := [Cl+z, • • • , tf\ and r* := . . . , r R+ i], its internal state 
is d* := [d^ L , . . . , cCq, . . . , dQ G T>, and its output is a 'velocity vector' in 
V := [— L — 2 . . . R+l}. That is, there is a unique update rule T : A L+2 xVx 
jR+i — and ve i 0Clt y f unc tion V : A L+2 x V x A R+1 — >V such that 

d m = T(l*,d*,r*) and z t+1 = z t + V(l\ d\ r*) (6) 

Let L and R be the unique (a, $)-transitive components of S such that 
[. . . C^C^i] i s L-admissible and [rfr^ . . .] is R-admissible for all t > (pos- 
sibly L = R). We say that a* has an (L, R) defect particle of width W. 

Example 1.2: (a) (ECA#184) Let A = {0, 1}. Let £ $ 184 : A z — >A Z be 
elementary cellular automaton #184. (Recall: the number '184' encodes the 
local rule <p : A^ 1 ' ' 1 ^ — >A via the formula Yh=o S)=o Y<k=o 0(*> 3i k){^i + 
2j + k) = 184). Let G* C A z be the Markov subshift given by digraph 
■ ^ □ (we use the convention that ■ = and □ = 1, to ease comparison 
between equations and figures). Thus G* := {(re:) 00 , (db) 00 }, where (re:) 00 : = 
[. . . . .], etc. (the zeroth coordinate is underlined). Then | G = 

as shown in Figure 2(*). There are two (G*, G*)-defects of width 0, shown in 
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Fig. 2. (*) The periodic background generated by £ &i&4 acting on G* in Example 
1.2(a). (a^, (3, u>^, J^)- Defect particles of £ <l>i 84 acting on G. (Note that the 
image labelled (/3) actually depicts the coalescence of an lo + and an oT into a (3.) 
See also [Piv07b, Examples 2.2(a) & 3.5(c)] or [BNR91, §III(A)]. 

Figure 2(7 ± ). The 7 + defect consists of a single inadmissible transition (a, ■), 
while 7~ consists of an inadmissible (□, □) transition. Now, W = 0, so T> = 
and T is trivial, and V : A x A — >{-l,0, 1}. We have V(0, 0) = 1 (for 7 +) 
and V{1,1) = -1 (for 7-). 

Let G01 C A z be the Markov subshift determined by digraph ■ □. Thus, 
G01 = G U G l9 where G := {m 00 } and G x := {n 00 }. Note that G i C 
Fix [ e <& 184 ] ; hence G01 is a ^-invariant subshift. The (Go, Gi)-defect of width 
is a particle shown in Figure 2(/3). (The (Gi, G )-defect of width zero is 
unstable, and immediately 'decays' into a two other particles). Again, W = 0, 
so V = 0, T is trivial, and V : A x .A — >{-l, 0, 1}. We have V(0, 1) = 0; i.e. 
the j3 particle is stationary. 

Let G C A z be the subshift of finite type determined by the admissi- 
ble 3-tuples G 3 := {(■■■),(□□□),(□■□),(■□■)}. We block-recode this as a 

Markov subshift in the alphabet A 3 , given by digraph (■■■); (□■□) 

(■□■); (□□□). Thus, G = G*UGoi, where G* and G i are as above. Thus G 
has three ( £ $ 184 , <r)-transitive components: G*, Go and Gi. Figure 2 shows sev- 
eral defect particles of £ <3> 184 acting on G. The defects a ± and w ± have width 1, 
so D = A 3 . Although they had width as defect particles in G* or G01, the de- 
fects 7 ± and (3 have width 2 as defect particles in G, so V = A 3 x A 3 (although, 
by the definition of block- recoding, we could replace this with T> = A 4 ). For 
all seven particles, the defect word d e T> is constant over time. The values 
of L, R, d, and the (constant) value of V : A x V x A — >{— 1, 0, 1} for each 
defect are given by Table 1. See also Example 2.3(a). 

(b) (ECA#54) Let A = {0, 1}. Let £ $ 154 : A z — >A Z be elementary cellular 
automaton #54. Let B := B U Bi, where B is the four-element cx-orbit of 
0010 and B\ is the four-element a-orbit of 1101. Then £ <3> 54 (Bo) = B\ and 
e $5 4 (Bi) = B , so B is ( e $ 54 , a)-transitive, so all defects have L = B = R. 
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Table 1 

Seven defect particles for ECA#184 acting on G; see Example 1.2(a). 




Fig. 3. (*) The periodic background generated by £ $ 54 acting on B in Example 
1.2(b). (a, (3, 7 1 * 1 ): Four defect particles of £ $ 54 acting on B. See also [Piv07b, 
Example 3.5(b)], [CH97, Fig.8], or [BNR91, §III(C)]. 



Also, e$54 1 B = cr 2 [see Figure 3(*)]. We recode B as a topological Markov 
subshift in the alphabet A A , with admissible 4-words B := {□□□■, 

□■■■}. Figure 4 shows the £ $ 54 -evolution 
of several defect particles in B, along with the relevant values of z, R, L, V, 
V, and T. See also Example 2.3(b). 

(c) (ECA#110) Let A = {0, 1}. Let E C A z be the 14-element a-orbit 
of the 14-periodic sequence (■■■□■■□□■□□□□□) 00 . If £ <I> 110 is ECA#110, then 
e $uo I „ = o -4 [see Figure 5(*)1, so E is ( e $ uo , cr)-transitive, so all defects have 
L = E = R. Figure 6 shows the e $ 110 -evolution of two defects (called 'A' and 
'5' in the nomenclature of [Coo04]) along with the relevant values of z, R, L, 
V, and V. 

Remark 1.3: (a) V takes values in V := [— L — 2 . . . by Lemma 1.1(b). 

However, the average value of V over time must be in [—1,1], because the left 
endpoint of the defect has a minimum velocity of —1, while the right endpoint 
has a maximum velocity of +1 [by Lemma 1.1(a)]. If V < 1 (resp. V > 1), this 
means that the right (resp. left) endpoint is moving leftward (resp. rightward) 
at speed greater than 1, which means the defect particle is shrinking, which 
is only sustainable for a short period of time. For example, the particle can 

— # 

achieve an instantaneous velocity V = R+l only by shrinking from a defect of 
width W to one of width 0; it must later remain at velocity V = for (R+ 1) 
iterations to grow back to width W. 
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Fig. 4. Defects in ECA#54. We treat the symbol a z as 'defective' if the word 
(a z _2, a z , a z +i) is not B-admissible. The admissible segments of each sequence 
are boxed; hence the unboxed segments are the defect words. The table on the right 
describes the values for z, R, L, and V and the definition of T> in each well 
as the relevant values of the update rule T : AxT> x A — >T>. See Example 1.2(b). 
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Fig. 5. (*) A 30 x 30 image of the periodic spacetime diagram of £ f 110 acting on E 
from Example 1.2(c); (A,B) 60x60 images of the e $ 110 -evolution of two defect par- 
ticles in E; See [Lin86,McI99a,McI99b,CHS01], [IlaOl, §3.1.4.4], [W6102, Chap. 11], 
[Piv07b, Example 3.5(d)], and especially [Coo04]. 

The 'constant width' convention of eqn.(5) masks this shrinkage by 'padding' 
the defect word d i+1 with up to R+l admissible characters from af^ 1 1 Zt+1 +R+2] i 
this is why the function T needs r* as input. Likewise, possibly rapid leftward 
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Fig. 6. The A and B defect particles of ECA^llO. We treat the symbol a z as 'de- 
fective' if the word (a z -Q, . . . , a z , . . . , az+i) is not E-admissible. The admissible 
segments of each sequence are boxed; hence the unboxed segments are the defect 
words. The table on the right describes the values for z, R, L, and V in each case. 
The arrow path is the sequence (zt)te"- The left-hand and right-hand polygonal 
paths are the sequences (L t )ten and (Rt)teN- See Example 1.2(c). 

motion requires T to incorporate 1* as input. In most examples, however, the 
particle moves slowly, and we can reduce the number of boundary inputs. 

(b) If W = 0, then T> = and T is trivial, while V is a function V : 
Ax A — >{-l,0,l}. 

(c) If W > 1, then by passing to the Wth higher power representation 
[LM95, Defn.1.4.4], we can assume that L = and R = 1, so that W = 2. To 
see this, replace A with A := A w , and represent a = [. . . a—\ a a\ a 2 . . .] G A z 
by 



a-2W a -W 



"0 



a W 



12W 



a-w-l 



l-l aw -I a 2W~l a 3W-l 



e A 1 



(Note: this is not the same as the higher 6/ocfcrecoding described earlier). Thus, 
if a* is as in eqn.(5), and z t := [z t /W\ , then a* = [. . . fa k d d\ ?i? 2 . . .}, where 
C n ,? n G Sw for all n G N, while d and d\ are in A. The original defect word 
d* is split between do and d\. The particle's behaviour now depends only on 
its nearest neighbours, and its speed is never greater than 1. In other words, 
we have T : A x V x A — >V andyJxDx A — ►{-!, 0, 1} in eqn.(6). The 
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Table 2 

Kinematic regimes for one-dimensional defect particles. 



price of this manoeuvre is that each defect word deD can be represented in 
W distinct ways as a pair d ), depending upon the value of z t mod W; 
this may translate into W spuriously distinct 'particle types' [see Definition 
2.2 and Remark 3.5(a) below]. Also, it may make some dislocations look like 
interfaces [see Remark (d) below]. Nevertheless, it will be useful in the proofs 
of Proposition 4.1, Theorem 3.3 and Corollary 3.6. 

(d) Two sequences b, c G A 1, are homoclinic if there is some N > such 
that b z = c z for all z G Z with \z\ > N. The defect in a* is called removable 
if a* is homoclinic to some s G S; see [Piv07b, §1]. Otherwise the defect in 
a* is essential — i.e. it is impossible to remove the defect by changing a* in 
some finite set. If L ^ R [e.g. Figure 2(/3)] then an (L,R)-defect is called 
an interface, and is necessarily an essential defect; see [Piv07b, §2]. If L = R 
[e.g. Figure 2(7 ± )], then an (L,R)-defect is called a dislocation, and may or 
may not be essential, depending upon whether it induces a 'phase slip' in the 
periodic structure of L; see [Piv07b, Example 3.1]. <0 

The kinematics of a one-dimensional defect particle falls into several regimes 
summarized in Table 2, depending on the (a, <3>)-dynamical complexity 
of L and R. In the Ballistic regime (see §2), the defect acts as a fi- 
nite automaton driven by periodic input, and moves with a constant av- 
erage velocity through a periodic background. ECAs #54, #62, #110, 
and #184 are all in this regime, which has been studied empirically 
in [Gra84b,Gra84a,BR91,BNR91,Han93,CH97,CHM98]. At the opposite ex- 
treme, in the Diffusive regime (see §3), the defect acts like a finite-state Markov 
process, and performs a generalized random walk. Diffusive defect dynamics 
has previously been analyzed by Eloranta [EN92,Elo93a,Elo93b,Elo94]. In the 
Turing regime (see §4), the defect moves through an inert, positive-entropy 
background, and modifies this background with its passing; the system acts 
like a Turing machine, where the particle is the 'head' and the inert back- 
ground is the 'tape'. In the Autonomous Pushdown Automaton regime (see 



11 



§4), the defect has a $-fixed, positive cr-entropy domain on one side (which 
we treat as a 'stack' memory), and a zero-entropy domain on the other side; 
the system acts like a pushdown automaton operating autonomously (i.e. with- 
out external input). In the Markov Pushdown Automaton regime (see §4), the 
defect has a <3>-fixed, positive a-entropy domain on one side (which we treat 
as a 'stack'), and a $-resolving subshift on the other; the system acts like a 
pushdown automaton driven by noise from a Markov process. The Compli- 
cated regime is none of the above, and is probably too diverse to make any 
useful generalizations. 



2 The Ballistic Regime 

Let $ : A z — >A Z be a cellular automaton, and let X C A z be a ^-invariant 
Markov shift. Let L, R C X be ($, a) -transitive subshifts of X, let W e N, and 
let Dlr, be the set of all sequences in A z with a single (L, R) defect particle 
of width W, such as shown in eqn.(5). By hypothesis, $(D^ R ) C D}^. 

Theorem 2.1 Suppose L and R are a-periodic and (er , $) -transitive. Then 
the dynamical system (D}^, $) is isomorphic to a dynamical system (X x 
Z, E), where X is a finite set, and where E : X x Z — >X x Z is defined by 
H(^, z) : = (£(0, z + f or some functions £ : X — >X and V : X — >V : = 

[-L-2...R+1]. 

Proof idea: The defect particle's internal state is a finite automaton driven by 
a periodic input (because L and R are periodic); thus, by incorporating the 
phase of this periodic input into the state description of the defect, we can treat 
it as an autonomous finite automaton (i.e. a finite-state dynamical system) 
(X, £). The defect's position is then obtained by integrating the velocity signal 
generated by (<-f,£). 

Proof: Any u-periodic sequence is automatically ^-periodic. Thus, by hypoth- 
esis, every element of L is <r PL -fixed and $ < ^ i -fixed for some Pl, Ql £ N. 
But L is ($, <r)-transitive, so this means that L consists of a single finite 
(er, $)-orbit containing exactly PlQl elements. Recall that L~ C A^ 00 "^ 1 ^ 
is the set of all left-infinite L-admissible sequences. Since L is a cx-periodic 
Markov shift, any element [. . . , [3, C2, in L~ is completely determined by the 
entry ( 1 . Hence there is a subset C C A with #(£) = PlQl, and a bijection 
A : C — >L~ where, for any ( G £, A(/~) is the unique sequence [. . . , C 3 , ( 2 , fi] in 
L~ with Ci — C . Furthermore, there are bijections <Pl : C — >C and ^ : C — >£ 
such that $ o A = A o ip L and a o A = A o q L . 

Likewise, R is <r Pfl -fixed and $^ fl -fixed (for some Pr,Qr 6 N), so R has 
exactly PrQr elements. Recall that R + C A^- 1 '" 00 ^ is the set of right-infinite 
R-admissible sequences. There is a subset 1Z C A with #(H) = PrQr, and 
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Table 3 

Ballistic defects in ECA#184; See Example 2.3(a) 

a bijection p : 72. — >R + so that, for any r G 72, p(r) is the unique sequence 
[n, r 2, ^3, • • •] in R + with with ri = r. There are bijections tfR : 72 — *-72- and 
<,r : 72 — >72 such that $op = poy? R and cr o p = p o q R . 

Thus, the sequence a* in eqn.(5) is entirely determined by the data 
(f/, d*, r{\ z t ) G C x V x 72 x Z, because [. . . ^, f 2 *, f/] = A(^), and p(r{) = 
[rf, r|, r |, . . .]. Define * : £ x V x 72 x Z — >D^ R by , d, r ; 2) := [ldr], 
where 1 := A(f) G L~, r := p(r) G R + , and we place d so that its center 
coordinate is at z. Then \1/ is a bijection. 

Let X := £ x V x TZ. IfT and V are as in eqn.(6), then we can restrict 
them to functions T| : X — >T> and v := V\ : X — >Y. Define £ : X — >X by 
£(f,d, r) := (c',d',r r ), where 

f' := ^cxp^f), d' := T(f,d, r), and r' := Sr°<Pr(t), where v := V^f, d, r). 

Now define H : x Z — x Z as in the theorem statement. Then $ o \f = 
f o». □ 



Definition 2.2: The (finite) dynamical system (X, £) decomposes into a finite 
disjoint union of finite ^-orbits, called particle types, li V G X is a, particle 

type, then P := #(V) is the period of type T 7 , and V^T 7 ) := — ^ V(p) is the 
average velocity of type V . <0 

Example 2.3: (a) (ECA#184) We continue Example 1.2(a). There are three 
(<r, e $ lg4 )-transitive components in G, so there are three possible choices for R; 
for each one, we list the corresponding values of 72, Pr, Qr, p, $r, and <pr in 
Table 3(A) (the values for £, P L , Q L , A, and ip L would be exactly the same). 
In Example 1.2(a), we introduced seven defect particles for (G,<3>): four of 
width 1, and three of width 2. In all seven cases, we have T(f , d, r) = d. Thus, 
(:£xDx 72 — >C x V x 72 is given by f (f, d, r ) = o v?l(0» d > ^ ^( r )); 
where v := T^(f , d, r) G {—1, 0, 1}. The value of V is constant for each particle 
type, and was shown in the bottom row of Table 1. In all cases, we end up 
with £ = Id, so all particle types have period 1. Hence, the average velocity 
of each type is just the value of V on the (unique) member of that type. 

(b) (ECA#54) We continue Example 1.2(b). In this case, L = R = B, and 
£ = 72 = B := {□□□■, □■■■}. 
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The maps (fL = <Pr and q, = are defined 



Consider the 7 defects in Figure 4. In this case, T> — A — {0, 1}, so X = 
B x A x B. The 7 ± particle types correspond to 2-periodic orbit classes T + 
and T~ of the dynamical system £ : X — >X, where 

r + := {(□□□■, □, ■■■□) , (□■■■, □, □□■□)} 
and r := {(□■□□, □, ) , (□■■■, □, )} . 

Note that d = d is always □. Also, V = +1 on T + , so that £(c,n, r) = 
o <f(c), □, q o (p(r)) for both (/", □, r) G Y + . Likewise, V = — 1 on r~, so that 
£(f , □, r) = (q" 1 o □, o ^( r )) for both (f , n, r) eT". 

Remark: Theorem 2.1 can easily be generalized to defect particles in Z D , where 
X C A z is a transitive, cx-periodic subshift of finite type. However, if D > 2, 
then such particles cannot be essential defects [see Remark 1.3(d)], because if 
X is cx-periodic and D > 2, then any finite defect in X is removable. Defect 
particles may still be ^-persistent, however. The most familiar examples of 
removable, yet persistent, ballistic defect particles are the 'gliders' and 'os- 
cillators' of Conway's Game of Life [Epp02,Got03,BCG04] and its variants 
[Bay87,Bay88,Bay90,Bay91,Bay92,Bay94,Eva96,Eva01,Eva03a,Eva03b,Eva05]. 




3 The Diffusive Regime 



Under certain conditions, a defect particle performs a generalized random 
walk. To demonstrate this, we first review some elementary probability theory. 

Bernoulli Measures and (hidden) Markov Measures: Let A be a discrete set 
(finite or countable), and let A4(A N ) be the set of Borel probability mea- 
sures on ^4 N . If fx G M.(A N ), then fx is a-invariant if a (fx) = fx, where cr(fx) G 
M(A N ) is defined by a(fx)[B] := fx[a~ 1 (B)] for any Borel subset B C A N . 
The measure-preserving dynamical system (*4. N , fx, a) is then called a station- 
ary stochastic process. For any m, n G N and any c G A°" m \ let [c] n : = 
I a G A N ; 2L[ n ...n+m] = c} be the cylinder set defined by c at position n. Clearly, 
fx is cr-invariant iff /i([c] n ) = /x([c]o) for all m, n G N and c G A°" m ^. Thus, 
we write "/x[c]" to mean /x([c]o). We call fx a Bernoulli measure if there is a 
measure fx G M. (A) (the 'one-point marginal' of fx) such that, for any n G N 
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for any c G A [0 - n] , /i[c] = //> (co) pP (ci) ■ • -/i°(c n ). For example, if #(A) = A, 
then the uniform measure is the Bernoulli measure r\ with 77° (a) = 4 for all 
a e A; hence 77(c) = for all c G ^l - 71 !. 

A measure p G -M(*4. N ) is a Markov measure if there is a measure /x° G 
.M(.A) and a transition probability function r : A — >M.{A) such that, for any 
c G A^~ m \ /i([c] ) = /i ( co )r( C o, C i)r(c 1 , C2 )---r(c n _ 1 ,c n ); see [Kit98, §6.2], 
[LM95, §2.3] or [VarOl, §4.4]. In this case, p is er-invariant iff p° is stationary, 
meaning that p°(b) = I] a e^A t °( fl ) " T ( a ,b) for all b G A. For example, any 
Bernoulli measure is a cx-invariant Markov measure, with r(a, 5) = p°(b) for 
all a, b G A If £> is another set, and if) : A — >B is any function, we define 
. a n — > b n by ^N( fli) fl2? _ _ . = (^,( ttl ) j ^,( fl2 ) j . . .). We say v G .M(i3 N ) is a 

hidden Markov measure \i v = tp N (p), for some Markov measure p G 7W(^4 N ) 
and function ip : „4 — Bernoulli/Markov measures on A z are defined anal- 
ogously. 

Random Walks: Let V C Z, and let z/ G .M(V N ) be a hidden Markov measure. 
Define £ : V N ^Z N by S(v 1; v 2 , v 3 , . . .) := (0, v 1; Vl +v 2 , Vl +v 2 +v 3 , . . .). The 
probability measure uj := S(i^) G .M[Z N ] is called a (generalized) random walk, 
with increment process v. For example, the one-dimensional Simple Random 
Walk (SRW) is obtained by setting V := { — 1, 1} and letting v be the Bernoulli 
measure with ^[±1] = |; see [VarOl, Example 4.1]. 

Resolving subshifts: Let B C A, and let ScB z C A z be a Markov subshift. 
For any 6 E B, let Vs(b) ■= {« G £> ; (a, 6) G S2} be the predecessor set of 6, 
and let Fs(b) := {c G £> ; (£, c) G S 2 } be the follower set of 6. We say that S 
is left-regular if there is some constant P§ £ N such that ^ ["Pg(6)] = P§ for all 
b G £>. Likewise S is right-regular if there is some constant £ N such that 
#[^s(0]=^sfor all b E B. 

The Parry measure r\ G .M(S) is the measure of maximal cr-entropy on S, and 
is a Markov measure on S which assigns roughly equal probability to all S- 
admissible paths of any given length; see [Par64, Thm.10], [LM95, §13.3], or 
[Kit98, Thm.6.2.20]. If S is left- or right-regular, then rf is the uniform measure 
on B. If S is right-regular, then r(£,«) is the uniform measure on Ti^b) for 
every b G B; that is, r(b,c) = 1/Fs for all c G J-s{b). Likewise, if S is left- 
regular, and we define the 'backwards' transition probability r: A — >Ai(A) 
by r (a, b) := ri[ab}/r] (b), then r (•, b) is the uniform measure on Vs(b) for 
every b G B; that is f (a, b) = 1/P S for all a G V s (b). 

Let $ : A z — >A Z be a CA with $(S) C S. Suppose $ has local rule 4> : 
^{-1,0,1} — ^ Then S is a left-resolving subshift for $ if, for any fixed (6, c, d) G 
S 3 , with e := cj)(b, c, <f), the function Vs(b) 3 a 1— > 0(a, 6, c) G Ps( e ) is injective 
[LM95, Defn.8.1.7]. If S is left-regular, then 'injective' implies 'bijective'. In 
this case, for any (b, c) G S 2 , define 0(S, b, c) := {4>(a, S, c) ; « G Ps^)}; then 
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# [0(S, 6, c)] = Ps- Likewise, S is a right-resolving subshift for $ if, for any fixed 
(a, 6, c) G S 3 with e := 0(a, 6, c), the function !Fs(c) 3 d 1— > <f)(6, c, d) G J-"s( e ) 
is injective. If S is right-regular, then 'injective' implies 'bijective'. In this 
case, for any (6,c) G S 2 , define 0(6, c,S) := {0(6,c,<f) ; <f G JF(c)}; then 

# [0(6, c, S)] = -Fs- If S is either left- or right-resolving for $, then $(S) = S, 
which implies that the Parry measure on S is ^-invariant [Kit98, Thm.6.2.21]. 

Example 3.1: (a) Let B C A and let S := B z . Then B z is left- and right- 
regular, because Vs(t) = B = F s (6) for all b G B. If $ : — >A Z and 
$(i3 z ) C B z , then £> z is left-resolving for $ iff $ is left- perm utative on B, 
i.e. for any (6,c) G i3 2 , the function B 3 a 1— > 0(a,6,c) & B is bijective 
[Hed69]. Likewise, £> z is right-resolving iff $ is right-permutative on £>, i.e. for 
any (a, 6) G £> 2 , the function B 3 c 1— > 0(a, 6,c) £ B is bijective. The Parry 
measure is the uniform measure on £> z , and is preserved by any permuta- 
tive cellular automaton. In the terminology of [Elo93a,Elo93b], B is called a 
permutive subalphabet for $. 

(b) For example, let (£>, +) be a finite abelian group and let S := B z . Then 
$ is a linear cellular automaton on B z if there are endomorphisms tf-i, y^o? ipi G 
End (jB) such that, for all 6-1, 6 , 61 G B, we have 0(£_i, ^o; ^i) = V-i(^-i) + 
Vo(^o) + in this case, $ is left- (resp. right-) permutative on B iff 0_i 
(resp. 0i ) is an automorphism of B. (Note that we do not require that A be 
a group, or that $ be linear on the rest of A z .) Under pointwise addition, B z 
is a compact abelian group, and the Parry measure (the uniform measure) is 
the Haar measure on B z . 

(c) In particular, if B = Z/ n for some n G N, then $ is linear if there are 
constants <p-i,ipo,(pi G Z such that 0(£-i, 60, b\) = (<£>_i£_i + y?o£o + <Pih) 
mod n. In this case, <3> is left- (resp. right-) permutative on B iff 0_i (resp. 0i) 
is relatively prime to n. The Haar measure rj on i3 z is the 'natural' invariant 
measure for such linear CA. For example, if \x G M.(B Z ) is any measure sat- 
isfying broad conditions (e.g. an iV-step Markov measure with full support), 

1 N 

then $ asymptotically randomizes p, meaning that ^lim — ^ & n {p) = V in 

n=l 

the weak* topology on M(B Z ); see [Lin84,MM98,MM99,PY02,PY04,PY06]. 
Furthermore, if n is prime, then 77 is the only ^-invariant, a-ergodic measure 
with positive entropy [HMM03, Thm. 12]; for some generalizations and related 
results, see also [Piv05,Sab07]. 

(d) If (B, +) is a finite group, then a Markov subgroup is a Markov sub- 
shift S C B z which is also a subgroup of (B z , +); see [Kit87,KS88a,KS89] and 
[Kit98, §6.3]. It follows that ^ s (0) and V s (0) are subgroups of B (see [Kit87, 
Prop.3(ii)] or [Kit98, Lem.6.3.4(ii,iii)]). Furthermore, S is left- and right- reg- 
ular, because for any 6 G B, J r s{b) is a coset of J^s(O), and Vs((>) is a coset of 
Vs(0) (see [Kit87, Prop.3(iii)] or [Kit98, Lem.6.3.4(iv)]). The Parry measure 
of S is then the Haar measure on S as a compact group. 

If $ is a linear CA with local rule 0(6-i, &o, 61) = <f-i6-i + </?o^o + fih f° r 
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some constants <p_i,<pO) < Pi G Z [as in Example (c)] then $(S) C S. Also, $ 
is left- (resp. right-) permutative on B iff 0„i (resp. 0i) acts injectively on 
Ps(0) (resp. JF S (0)). As in Example (c), many measures on S (e.g. Markov 
measures with full support) are asymptotically randomized to 77 by such CA; 
see [MMPY06a,MMPY06b] . 

(e) Let B = {6} and let S = B z = {b}, where b = [. . . 666 . . .]. Then S is 
trivially left- and right-regular, because ^s(^) = {^} = Vs{b). If $(b) = b, 
then S is left- and right- resolving for $. The Parry measure of S is the point 
mass on b. (} 

Let $ : A z — >A Z be a cellular automaton. A resolving system for $ is a 
quadruple (L, R; A, p), where: 

(1) L, R C A z are Markov subshifts, and L U R is also a Markov subshift. 

(2) L is left-regular, $(L) C L, and L is left-resolving for $. 

(3) R is right-regular, $(R) C R, and R is right-resolving for $. 

(4) A G M.{L) is the Parry measure on L, and p G A4(R) is the Parry 
measure on R. 

Example 3.2: If £,K C .4, then A := £ z U ft z is a subshift of finite type 
iff either C = 1Z or they are disjoint. In this case, (C z , TZ Z ; A, p) is a resolving 
system for $ iff: [i] A (resp. p) is the uniform measure on C z (resp. TZ Z ); [ii] 
$(£ z ) = £ z and $ is left-permutative on £; and [iii] $(7?. z ) = TZ Z and $ is 
right-permutative on TZ, as in Example 3.1(a). (} 

Let (L, R; A, p) be a resolving system, let A := L U R, and let be the 

set of all elements in A z with a single (L, R)-defect of width W at zero. Let 
T> := A w . If 5 G Ai(T>), then we regard A (8> 5 <E> p as a probability measure 
on in the obvious way. Define ( : Dj^ R — >Z N by C(a) := (^ 0) Zi, z%, . . .), 

where, for all t 6 N, z t = Zt(a) G Z is as in eqn.(5). In other words, £(a) 
tracks the trajectory of the defect particle over time. If n G AI(Dlr), then 
C{p) is a probability measure on Z N . The main result of this section is: 

Theorem 3.3 Let $ : A z — >A Z be a CA, and let (L, R;A,p) be a resolving 
system for <3>. Let W G N, let 5 be any probability measure on T> := A , and 
let ji := A ® 5 ® p G -M(D^). Then uj := C(p) G A^(Z N ) zs a random walk. 



Example 3.4: Let A := Z /2 x{o, •}. Define <p : A { - lfi > 1} — >Abj <p ( 



6-1 60 61 




" ) , where 
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(A) Scale: 50 X 50 (B) Scale: 300 X 6000 (space X time) 

Fig. 7. The randomly walking defect particle of Example 3.4. 



a_l + uq + a\ if 6-1 = 6q = b\ = o; 

a_i + AO if f>—\ = 6q = o and 6\ = •; 
ao + «i if 6-1 = • and 6q = 6\ = o; 
1 - oo if £o = •■ 

• if £_i = • and a—\ = ao = 0; 

• if 6\ = • and ao = a i = lj 
o if £ = • and uq = a\ = 0; 

o if 6q = • and a_i = ao = 1; 
6q otherwise. 



Let L = 1Z = Z/2 x {°} j which we identify with Z/ 2 ; then $ acts on L := C 1 



11* 



R like the linear cellular automaton \1/ : (Z 



/2j 



(Z 



/2J 



with local 



rule ip(?c-i, %o, ki) '■= + Ka + (mod 2). Thus, £ = 7£ is a left- and right- 
permutative subalphabet for $ [see Example 3.1(c)]; if A = p is the uniform 
measure on L = R, then (L,R;A, p) is a resolving system, as in Example 
3.2. The set Z/ 2 x {•} is the set of defect states. An element of Dl r, has the 

form [...* 'I l \ do m n o T2 o r o 3 ...], where ( h r h d G Z /2 . The defect particle '•' 
moves left if l\ = — 1, and moves right if d = r\ = 0; otherwise it remains 
stationary. Figure 7(A) shows a close-up spacetime diagram of the resulting 
random walk, while Figure 7(B) shows a large-scale spacetime diagram of the 
same walk. (} 

If p G M(A N ), then we write, "For V M a G A N , [statement]", or "[statement], 
(/i-ae)", to mean "//{a G *4. N ; [statement] is truej = 1". If I C N, let prj : 
A N — >A l be the projection map. Thus, if p G M{A m ), then pr^p) G M(A l ). 
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Legend: 

I Defect particle path 



Cells determined by (Q * 
_&{t} 

A-random initial condition in L 
Incoming A-random 'noise' 



&* 
6[0..i] 



p-random initial condition in R H 
J Incoming p-random 'noise' 



Fig. 8. A schematic spacetime diagram illustrating the sigma algebras 
&{t} C 6[0...t] C 6* in Theorem 3.3. 

If J C N and b G v4. J , then let p|b be the conditional probability measure 
on *4 N given b; in other words, for any U C A n , p|b(U) := /i(U fl [b])//i[b], 
where [b] is the cylinder set defined by b. More generally, if U C A n is Borel- 
measurable, and if 6 is a u-subalgebra of the Borel a- algebra on A N , then let 
/i|s(U) be the conditional probability function of U given (5; i.e. jt/|g(U) is the 
©-measurable function such that, for any S G 6, / S jt/|@(U) dfi = p[S fl U]. 
This function is uniquely defined (p-ae); see e.g. [VarOl, §4.3]. 

In particular, if J C N, let (5 (J) be the sigma-algebra generated by all cylinder 
sets [c\j, where c <E A and j G J (hence & (N) is the Borel sigma-algebra of A N ). 
A cr-invariant p G A4(A N ) is Bernoulli iff, for any disjoint subsets I, J C N, and 
any b G A 1 , p|©(j)(b) = p(b) (p>ae). We say p is Markovian iff for any m G N 
and £ G *4, /J|e[o...m](Mm+i) = A«|e{m}(Mm+i)- Thus, p is a Markov measure 
if /x is Markovian and if, furthermore, for any a G A and for V^xG [a] m , we 

have yU| S {m}([£]m+l)(x) = r(a, 6). 

Proof idea for Theorem 3. 3: The left-hand measure A and right-hand measure 
p provide a continual influx of 'random noise'. The 'A-noise' propagates right- 
wards with unit speed because L is left-resolving for $, whereas the 'p-noise' 
propagates leftwards with unit speed because R is right-resolving for $. As 
shown in Figure 8, the defect particle's trajectory from time to time t is 
entirely determined by the information contained inside of a backwards 'light- 
cone' emanating from its position at time t back to the initial state at time 
zero. If the particle steps to the left [respectively, right] at time t, then it must 
step into the path of incoming A-noise [respectively, p-noise] which is outside 
of this lightcone, and hence, statistically independent of the particle's previ- 
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ous trajectory; see Figure 9(B) [respectively, Figure 9(C)]. If the particle stays 
put at time t, then it is exposed to both fresh A-noise and fresh p-noise; see 
Figure 9(A). In all three cases, the particle is subjected to fresh perturbations 
at time t + 1 which are statistically independent of its previous behaviour. 
Furthermore, A and p are ^-invariant, so the probability distribution of these 
perturbations is constant over time; hence they can be treated as a stationary 
Markov process, which drives the particle's motion. 

Proof of Theorem 3.3: Let A := A w , let $ : A z — >A Z be the IVth-power 
representation of $; and let L,R C A z be the W^th-power representations 
of L and R. Then L (resp. R) is still left-regular (resp. right-regular) and is 
still left-resolving (resp. right-resolving) for $. Thus, we can replace A with 
A, L with L, and R with R and proceed. By Remark 1.3(c), we can thus 
assume that W = 2 and that T : L 1 x A 2 x R x — >A 2 and V : L 1 x A 2 x 
Ri — 1, 0, 1}. A generic element of R has the form 

a = [. . . f 3 f 2 h ^o <t\ n rs r 3 . . .] , 

where ( n := a z _ n G Li and r n := a z+n+ 2 G Ri for all n£N, while ^ := a z+ i G 
A for i — 0, 1, with zeZ being the location of the defect. 

Let X := L 2 x A 2 x R 2 , and define £ : R — ^ so that, if a is as above, 
then £(a) := (fe, ft; 4), <fii a, r 2)- For any t G N, let £ t := £0$*. In other words, 
&(a) := tf; ^, 4\ r{, r|), where $'(a) = [. . . # ft* «tf <tf r* r\ r J . . .]. 
Next, define H : D£° R — # N by S(a) := (£ (a), 6(a), 6(a), .. .). Clearly, 
So$ = croH. Recall that 1/ is a function from Li x A 2 x Ri into { — 1, 0, 1}; 
treat this as a function V : A" — >{ — 1,0, 1}, and apply it coordinatewise to 
define V N : ^ N — ^{-1,0, 1} N . 

If Jl := £(//) G 7W(^ N ), and 1/ := V n (Jl) G -M({-1, 0, 1} N ), then tu = E(v). 
Hence, if pT is Markov, then z/ is hidden Markov, so that is a random walk, 
as desired. It remains to show that /Z is a Markov measure. 

Fix t G N. Let ©{£} be the sigma-algebra on A z generated by £t, and 
let £>[0...£] be the sigma-algebra on A z generated by (£0,6, • • • ,6)- F° r anv 
^ G A', let U* +1 := £t+i{^} be the set of all initial conditions in D^'r such 
that the defect particle at time t + 1 has internal state ^. To show that 
/I is a Markov measure, we must find some transition probability function 
r : X — >M(X) such that: 

For all 7t G X and t G N, P|e[o...t] 

For any 2 G Z, let D* := ja G D^'r ; ^(a) = zj, and let &* be the sigma- 
algebra on D* generated by cylinder sets in coordinates [z—t—2 . . . z+t+3]. 

2 

Then let 6* be the sigma-algebra on D L '. R generated by [Jzez&l- Clearly, 
<5[0..£] C (5*, because the information contained in &* is sufficient to de- 
termine the first t positions (z\, . . . , z t ) of the defect particle, and its first t 
internal states (£1, . . . , £ t ); see Figure 8. 



U 



t+i 



V\&{t} 



U 



t+i 



t&,k). (7) 
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Legend: 

Defect particle's path 



4-.J 



i £ 



t+1 




Cells determined by b 



6 




Incoming 'noise': cells influenced by left /right neighbours of b. 



Fig. 9. Claims 1.1 to 1.3 of Theorem 3.3. 



CLAIM 1: There exists a function r : X — >Ai(X) such that, for any ^ G X 
and t G N, we have M|e*[U* +1 ] = / U|e {t }[U* +1 ] = r(&,0- 
Proo/: Let * := (f 2 , d , ti, n, r 2 ), where (fe, ft.) G L 2 , (d , d\) G A 2 , and 
(n, r 2 ) G R 2 . Fix t G N. Let 6 = (tf , f/ ; ^, r *, rf) and & +1 = (f 2 ' +1 , f^ 1 ; 
rf" * +1 , rg +1 , where we regard these as twelve measurable functions 

on Dl'.° r . For v G {-1,0, 1}, let D v := {a G Dl' r ; 7(&(a)) = v}, and let 

mi T~v2,0 t~v-1 , nO i T-vl J .. v^l ..rr»vl ..v -5ir„ 



. Then Dl'r = D 1 U D° U D 1 , and fi = Ei=^i/i[D v ] • /i v - We 



will thus consider jj° and /i 1 separately. 

For v G {-1,0, 1} and z G Z, let := {a G D v ; ^(a) = 2}, and let 
B: := pr M _ 2 ..., H4H5] [D^] C Then let B v := [J B v 2 . 

Claim 1.0: For any b G B^, let [b] := {a G Dl' r ; a M „ 2 ...^ +3] = b}. 
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Then D v = J [b]. 

beB v 

U [b], for any z G Z. Thus, D v 
beB" 



Proof: B v z 
U W. 



U U [b] = 

zeZbeB" 

V Claim 1.0 



beB" 



Claim 1.1: For any * e X, /xj@*[U 

1 if 



K 



r t+l] 



To 



where 



) 



/" £ /■ £ . c £ c £ . £ £ 
*2 ' l l i a ' a l i 'It r 1 

h, k'i do, i\\ n, r2 



L^R 



(7 2 *, f/, <fg), 
i 



* { t } [U* 

d o = 4>{ti-> d 0> £ l), <*i = <K^0 ' ^1 ' r l)> 
n. = <j>(ti,4,r$), and r 2 G J r R(r 2 ); 

otherwise. 

Proof: Figure 9(A) shows how the values of , (£ * + , ^1 ; are deter- 



mined by the data in ©{£}, because fj* + 



rt+l 

"•1 — v 
and r 2 m 



|V , ^ , r x ), and r x 



f+i 



/l; r D r 2) 



C*, cCq), d t+1 
. However, /" 9 t+1 



Ji ) ^0 ' ^1 )j 
) ^2 j h ) 



r i5 r 2i r 3) n °t determined, even by &*. Instead, for any fixed 



z6Z and b G B°, there is a function $ b : PiX^_ 2 ) X-7"R.( 



^r(^ +1 ) such that (£ 



t+i t+i 

5 r 2 



z+t+3 J 



) = $b(^°-t-3) r i++4)- Furthermore, <3> b is bi- 



jective, because L is $-left-resolving and R is $-right-resolving. 

The set Pl(4V 2 ) x ^r( has cardinality PlPr, because L is left- 
regular and R is right-regular. Let yU| b be the conditional measure on 
D° given b. If 7? b := pr{ z _ t _ 3j2+t+4 }(/i| b ), then 77^ is the uniform measure 
assigning mass 1/(PlPr) to each element of V\X^-t~2) x ^R( r 2++3)' be- 
cause A is the Parry measure on L and p is the Parry measure on R. 
Note that any b G B° completely determines the values of fj* + and 



f+i 



r l - - (because these are (3*-measurable functions). Let ?7 b + be the uni- 
form measure assigning mass 1/(PlPr) to each element of PL^i* +1 (b)] x 
•7-R,[r* +1 (b)]. Then the /xj^-probability distribution of (£2 , r l +1 ) is the mea- 
sure $b(pr {2 _ t _3 i ^ +4 }(/ i jb)) = $ b(^b) vi +1 (here (*) is because $ b is bi- 
jective, while both 77^ and 7/ b +1 are uniform measures on sets with PlPr 

^ b M, where Tq is as defined above, and 



elements). Thus, //P b [U* +1 ] = r 
where we can treat £ t as a function of b (because £ t is ©^.-measurable). 
This holds for any b G B°, so Claim 1.0 implies that Atje»[U* +1 ] is the 

function D° 3 a 1— > r f^*^J £ [0, 1]. But this function is (5{t}-measurable 
(because £t is (5{t}-measurable by definition), so it is also /^js{rt[U* +1 ]. 

V Claim 1.1 



Claim 1.2: For any K e X, ^[U^ 1 ] = /^[U*" 

—2 if ( 2 € Pl(u), (i e <KL,t£,c£ 



rt+n 



T-i 



where 



r t /"t. ./"t yt. £ t 

2 ' 1 ' ' 1 ' 1 ' r 2 

£2 , £1 ; rfb , ^1 ; n , ^2 



(Pl) 5 



^ 2 *, C\, do), 



dl 



k, d i), d i), 
id 



n = 4>{do, d^ 4), and r 2 
otherwise. 



t t t\ 

1, T 1, T 2) 
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Proof: Figure 9(B) shows how the values of (<*o + \ <*i +1 ; ^ +1 ,r| +1 ) are de- 
termined by the data in &{t}, because ^ <+1 = if, <f *), ^i <+1 = 
0(4*, 4*, <f*), r* +1 = 0(4*, n*), and r 2 m = 0(4*, rf, r*). However, f* +1 = 
0(f 4 *, f 3 *, £j*) and 4* +1 = 0(^*, ^*, £,*) not determined, even by 6*. Instead, 
for any fixed z G Z and b G B^ _1 \ let 

£ b := {(U4, ^a) e ^ 2 ; e V^m) and 4-t-4 e Pl(4-^)}; 

Then #(£ b ) = (Pl) 2 ' because L is left-regular. Let /i| b be the conditional 
measure on D^ -1 ) given b, and let A b := pr^^ ^^dx^) G .M(£ b ); then 
A b is the uniform probability measure assigning 1/(Pl) 2 to each element 
of £ b , because A is the Parry measure on L. 
For any (f 2 , 4) G L 2 , let 

4) := 4') e ^ 2 ! <i e 0(L, 4, 4) and £ G P L (4')}- 

Then # [£(^> 4)] = (-Pl) 2 , because L is left-regular and $-left-resolving. 
Let A(4, 4) G M. [C(k, fi)] be the uniform probability measure assigning 
V(-Pl) 2 to each element of £(f 2 ,4). Note that any b G B^ -1 ^ completely 
determines the values of f 2 and 4* (because these are ©""-measurable func- 
tions). Let := £ (£*(b), 4*(b)) and A b +1 := 77 (#(b), /-/(b)). 

There is a function $ b : £ b — such that (4. m ,4 m ) = 
^b(4-i-^' ^-3)' and $b is bijective because L is $-left-resolving. Thus, 
the -conditional probability distribution of (r 2 +1 , 4* +1 ) is the measure 
$ b pr{a^4,a-*^}(^|b) = $b(A b ) = A t +1 . Here (*) is because $ b is bijective, 
while A b and A b +1 are both uniform measures on sets of (-Pl) 2 elements. 
Thus, /if b 1} [U* +1 ] = r_x (^ b) ), where r_i is as defined above, and where 
we can again treat £ t function of b. 

This holds for any b G B^ _1 \ so Claim 1.0 implies that [U* +1 ] 

is the function D^ 1 ) 3 a 1— > r_i (^'^) G [Oil]- But this function is 
(3{t}-measurable (because C, t is (5{t}-measurable), so it is also A*[e{t}[U* +1 ]- 

V Claim 1.2 

Claim 1.3: For any K eX, /if e *[U* +1 ] = ^ &{t} [U\ +1 } = n (f ), wiere 
Ti : — >vVf(A?) is defined similarly to r_i. 

Proof: Figure 9(C) shows how the values of (/" 2 + \ 4* + > ^i +1 ) are deter- 
mined by the data in &{t}, because 4* +1 = 0(£j* , 4*, <f *), 4* +1 = 0(4*, / *, <f*), 
<f m = <j>{4,4,r{), and = 0(^*, r*, r*). However, r x m = 0(rf, r 2 *, r|) 
and r 2 +1 = 0(r 2 , rj, r|) not determined, even by (5*. Now proceed as in 
Claim 1.2, but replace 4* +1 by r| +1 , 4* +1 by r{ +1 , L with R, A with p, 
and 'left-resolving' with 'right-resolving'. V ciaim 1.3 

Finally, define r : X — >M{X) by r{y, K ) := (*), where f(t,) G 
{ — 1,0, 1}, and where To and r±i are defined as in Claims 1.1 to 1.3. Then 
P|e4U* +1 ] = M|e W [U* +1 ] = r(&; 0- <> ^m 1 
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Claim 2: For any K G X and t G N, p\e[o...t] [U* +1 ] = p l& *[XJ{ +1 }. 
Proof: Claim 1 implies that u.\q* [U' +1 ] is actually a (5{t}-measurable function, 
because it is equal to /i|e{t}[U' +1 ]. But this means that /i| S *[U* +1 ] is also 
(5[0...t]-measurable, because &{t} C 6[0..i]. Also, for any C G (5[0...£], we 
have Jcj"|s*[ u * +1 ] dp = p[C n U* +1 ], because C G ©*, because 6[0...t] C 
(5*. But yU|e[o...t] [U' +1 ] is the unique (5[0...i] -measurable function with this 
property (by definition); hence /i| S *[U' +1 ] = P\<b\q ...t][U* +1 ]. O ciaim 2 

For any K G X and t G N, we conclude /i|e[o...t] [U* +1 ] = (/i| S *)[U* +1 ] = 
M|e{t}[U* +1 ] = t(^, where (*) is Claim 2 and (f) is Claim 1. Thus, 
eqn.(7) is satisfied. □ 

Remark 3.5: (a) Suppose the conditions of Theorem 3.3 are satisfied. Then 
the measure 5 can always be chosen so that the Markov measure ~p is shift- 
invariant (because every finite-state Markov chain has a stationary measure). 
The cr-ergodic components of /J are then the stochastic analogs of the particle 
types of Definition 2.2. 
The drift velocity of u is the expected value V dim (u) := ^[v] ■ v. If JL is 

vGV 

cr-ergodic (i.e. fx corresponds to a single particle type), then for z G Z N , 
lim (z n /n) = V dIi{t (uj) (by the Birkhoff Ergodic Theorem). Thus, V dlift (uj) is 
the long-term average velocity of particles of type ~p. For instance, the Markov 
chain in Example 3.4 has one ergodic component (i.e. one particle type), with 
V drm = 0. 

(b) A special case of Theorem 3.3 was previously proved in [Elo93b, 
Thm.2.1.1], for when L = C z and R = TZ Z are permutative subalphabets 
for $ [see Example 3.1(a)] and W = 0. In this case we must have 1Z ^ C 
for an (£ z , 7£ z )-defect to be meaningful. We recommend [Elo93a,Elo93b] for 
further interesting examples of diffusive defect dynamics, as well as analysis 
of their drift and variance. These methods were extended to defect ensem- 
bles in [Elo94], and to the pseudorandom motion of domain boundaries in 
two-dimensional boolean CA [Elo95]. 

(c) Empirically, the large a defect particle of ECA#54 [see Figure 3(a)] also 
performs a random walk, as can perhaps be seen in Figure 1(A). However, this 
motion is not due to the mechanism of Theorem 3.3, because a belongs to the 
'ballistic' regime of §2, not the 'diffusive' regime. Instead, the meandering 
is due to interactions with neighbouring a particles, mediated by a complex 
exchange of the tiny ^ particles of Figure 3(7 ± ). See [CH97, Fig. 13(b)]. ^> 

Corollary 3.6 Let $ : A z — >A Z be a CA and fix p,q G N. Suppose that 
either 

[i] L C Fix [$ p , a q ] and A G Ai(L), while R C A z is a right-resolving, 
right-regular Markov subshift with Parry measure p G M. (R) . 
or [ii] L C A z is a left-resolving, left-regular Markov subshift with Parry 
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measure A E M(L), while R C Fix [$ p , a q ] and p E M(R). 

Let W E N, let 5 be any probability measure on T> := A w , and let \i : = 
A® 5® p E M(P%$). Define ( : D^ R -^Z N by C(a) := (z , z p , z 2p , . . .), 
where, for all t E N, zt p E Z is as in eon. (5). TTien a; := £(/■*) £ A^(Z N ) is a 
random walk. 

Proof: (Case [i]) By using the gth higher power representation of A z [see 
Remark 1.3(c)] and replacing $ with $ p , we can assume that q — p = 1; i.e. 
that L C Fix[$,er]. Thus L = {f°°}f6£, where £ C ^4 is some subalphabet, 
and, for each I E C, the point C°° := [...[[[...] is $-fixed. Thus A = ^ c f l f , 

where, for each ( G £, 1 ( is the point-mass on and c f G [0, 1] is a constant. 
Thus, fj, = c f /i f , where /i f := l f (g) 5 <g) p. It suffices to prove that each 

of the measures /i ( induces a random walk. Hence, assume that A = If for 
some [ E C, and redefine £ := {/} and L := {C°°}- Then L is a left-regular 
and left-resolving subshift [see Example 3.1(e)]. Now apply Theorem 3.3. The 
proof of Case [ii] is analogous. □ 



4 The Turing Regime and Pushdown Regimes 

Recall that a Turing machine [HU79, §7.2] consists of a 'head' which deter- 
ministically moves back and forth along a 'tape', reading and writing symbols 
from some alphabet. To be precise, let T be a finite set. A (classical) Turing 
Machine with tape alphabet T is a quadruple (T>, r, T, V), where T> is a finite 
set (called the head state domain), r : TxV — >T is a tape rule, T : T xV — >T> 
is an update rule, and V : T x V — >{ — 1,0, 1} is a velocity rule. The machine 
statespace of the Turing machine is T z x T> x Z. If the machine is in state 
(t, (f , z) G T z x X> x Z, this means that the tape currently has symbol string 
t, the head is at position z on the tape, and the head has state description 
£ . If t := [. . . t z -i t z t z+ i . . .}, then define t' := [. . . t z -\ T(t z , d) t z+ i . . .}. The 
dynamics of the machine is the map G : T z xT> x Z — >T Z xDxZ defined: 

&(t,d,z) := (t', T(t z ,d), z + V(t z ,d)) . 

We will generalize this definition in two ways. First, we will imagine that the 
head lies between two tape symbols, rather than over a tape symbol. The head 
can read the two symbols to its left and two symbols to its right, and can 
overwrite the symbol immediately left or right. Second, we require that there 
are Markov subshifts L, R C A z such that the symbol sequence on the left 
half of the tape lies L~, while the right half lies in R + . The machine must 
write new symbols so as to respect the constraints of these subshifts. 



25 



Formally, an (L, H)-Turing machine is a sextuple (T>, r L , t , t r , Y, V), where V 
is a finite set, r L : A 2 x P — >A, t c : A x V x A — >A, t r : V x A 2 — >A, 
T : A 2 x V x A 2 — >V, and V : A x V x A — >{-l, 0, 1}. The statespace of 
the Turing machine is L~ xT>x R + x Z. If the machine is in state (1, d ,r,z) G 
L~ x T> x R + x Z, this means that the tape currently has symbol string 
(1 | r), where the head (indicated by '|') is at position z + \ on the tape, 
and the head has state description d. The machine dynamical system 6 : 
L^xDxR+xZ — kLt x£>xR+ xZ is defined by 6(1, <f, r, z) := (1', d', r',z'), 
where d' := T(f 2 , £l, d, n, n) and z' := 2 + V^fi, <£ , n.) and 



(1' | r') := 



4, 6, £ | 1, r 2 , r 3 , r 4 , 
) hi k>i hi L\ | r l, r 2, r 3) r 4, 
4, ^3; i I r 2' r 3) r 4) 



if Vik.d.n) = -1; 

if ^,^0=0; (* 

if K(fi,rf-,n) =+1. 



Here, f/ := r £ (fe, fi; rf") is such that (f 2 , f^) G L 2 ; 

r{ := r (<f ; n, r 2 ) is such that (r[, r 2 ) G R 2 ; 

and ^ := r c (^; ri) is such that ([{, t£) G L 2 , if V^fi, ^, a) = +1, 

whereas t-q := T a {C\] d\ n) is such that (t-q, r{) G R 2 if V(ii, n.) = _ 1> 

(If <£, n.) = 0, then the value of T c {k\ d; ri) is discarded, so it is irrelevant). 
Finally, 

• T(/" 2 , fi, d, ri, r 2 ) depends only on (/" 2 , fi, <f) if V(fi, <f, ri) = —1. 

• T(^ 2 , ki £ i n> r 2) depends only on (^, d , n) if V((i, d , ?i) = 0. 

• T(f 2 , (i, d , ri, r 2 ) depends only on (d , n, n) if V(fi, d, n) = +1. 

Proposition 4.1 Lei L, R C A z be Markov subshifts. Let W G N and Zet 
V := A w . Let L,RcV z be the Wth higher power representations of L and 
R. 



(a) Let $ : A z — >A Z be a CA with L,R C Fix[$]. Then the dynamical 
system (D^ R , $) is isomorphic to an (L, H)-Turing machine (T>, r, T, V). 

(b) Conversely, given any (L, H)-Turing machine (T>, r, T, V), there is a CA 
$ : A z — >A Z , with L,R C Fix[$], such that (Djf R , <&) is isomorphic to 

(d,t,t,v). 

Proof idea: The defect acts like the Turing machine head. The application of 
$ changes the head state, and can can also modify the adjacent symbols on 
the (L, R)-tape. However, just as in a Turing machine, the more distant tape 
symbols remain unchanged, because L and R are $-fixed. 

Proof: (a) By passing to the Wth higher power recoding, and replacing L 
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with L and R with R and A with A := A w , Remark 1.3(c) allows us to 
assume that V = A 2 and that T : A x P x ^4 — >£> and V : A x V x 
A — ►{—1,0, 1} in eqn.(6). To simplify notation, we will suppress the 'hats'. 
Define \& : R — kL~ xVx R + x Z so that, if a* is as in eqn.(5), then 
$(a*) := (1, d, r;z), where 



!:=[..., c*, r 2 4 , r/] G L 



r t t t 



gR 



d := [d*,d*] 6P = i 2 
and 2; := z t G Z. 



Let T and ^ be as in eqn.(6). If a m := $(a 4 ), then ^(a i+1 ) = (l',d',r'; z') 
where d' := [d^ 1 , d^ 1 ] = Y(7i,d, ri), and where 1' and r' are as in eqn.(8) 



with 



T c {k,d, n) 



$(r 2 ,fi,4); 

$(4,^ ,4) if ^(f lj d,r 1 ) = +l; 

$(4,4,^) if ^(f lj d,r l ) = -l; 

irrelevant if V(fi, d, ri) = 0. 



and r R (d, ri ,r 2 ) :=^(d\,r 1 ,r 2 ). 

(b) is a straightforward generalization of the method of Lindgren and Nor- 
dahl [LN90] for simulating a classical Turing machine with a cellular automa- 
ton. □ 



Proposition 4.1 applies even when L and R are cx-periodic subshifts, but in 
this case it isn't very interesting, because an (L, R)-admissible 'tape' can't 
encode any information, so the resulting Turing machine is rather trivial, and 
is described in §2. To perform useful computation, we need L and R to have 
nonzero entropy. If B C A z is a subshift, then the topological entropy of B is 
defined 

h(B,a) := lim lo ^ B ^>) , 

If B is a subshift of finite type, then h(B, a) > iff B is not a-periodic. In 
particular, if B is a Markov subshift defined by a digraph on the vertex set 
A, then /i(B, a) > iff this digraph is not just a disjoint union of cycles. 
Equivalently, there is a choice point vertex c G A, meaning that c belongs to 
at least two distinct cycles. See [LM95, Ch.4], [Kit98, §1.4] or [Kur03, §3.6.2]. 
A B-admissible sequence b can then encode nontrivial information, because 
for every z G Z with b z = c, there are at least two B-admissible possibilities 
for b z+ i, and a choice between these encodes at least one bit of information. 

A pushdown automaton [HU79, §5.2] is a finite automaton augmented with 
a 'stack' or 'last in, first out' ('LIFO') memory model. To be precise, a push- 
down automaton is a septuple (X, V, O, T; T, Q, S), where X, V, and O are a 
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finite input space, state domain, and output space, respectively (as in a finite 
automaton) , and T is a finite stack alphabet. Now T : X x T x T> — >T> is the up- 
date rule, n-.lxT xV — >(D is the output rule, and £ : IxT xD — >T U{0, A} 
is a stac/c riv/e. The machine statespace of the pushdown automaton isDx T N . 
The machine behaviour is defined by the map :IxPx T N — >X> x T N x (9 
defined Q(i,d,t) := t', o), where «£' = Y(i,^,t ), o := fi(i, <f,to), and 
where 



t' 



t 1; t 2 , t 3 , . . .) if E(t, d , to) = A (i.e. 'pop' the symbol to off the stack); 
t , ti, t2, • • •) if S(x, / , to) = (i.e. do not touch the stack); 
[t', to, *i, . . .) if £(/, ^, to) = t' (i.e. 'push' the symbol t' onto the stack). 



An autonomous finite automaton is a finite automaton with no input or output; 
i.e. a dynamical system T : T> — >T> where P is a finite set. Similarly, an 
autonomous pushdown automaton (APDA) is a pushdown automaton with no 
input or output; i.e. X = = O. Thus, the function fl is trivial, T : T x 
V — >V, and £ : T x V — >T U {0, A}. The APDA's future behaviour is 
entirely determined by the initial stack state. It is easy to see that an APDA 
is equivalent to an (L, R)-Turing machine where R = A z and L = {0} where 
is some 'null' symbol. Thus, we will treat it as such. 

Let M and M' be two machine-classes. We write M ^ M' if any machine in M 
can be simulated by one in M' (possibly not in real time). We say that M and 
M' are computationally equivalent (and write M rs M') if M ^ M' and M' ^ M. 
Let TM be the class of (classical) Turing machines, and let TMl r be the class 
of (L, R)-Turing machines. Let APDA be the class of autonomous pushdown 
automata, and let AFA be the class of autonomous finite automata. 

Proposition 4.2 Let L, R C A z be Markov subshifts. 

(a) // h(L, a) > and h(R, a) > 0, then 71v1 Li r w TM . 

(b) Ifh(L,a) > = h(R,a), orh{L,a) = < h(R,a), then TM LjR w APDA. 

(c) If h(L, a) = = h(R, a), then TM LjR w AFA. 

Proof: By a cycle of length P in R, we mean a word c — (c±, . . . , cp) G Rp, such 
that (cp, Ci) G R 2 ; hence the infinite sequence [. . . ccc . . .] is R-admissible. 

Claim 1: Suppose h(R, a) > 0. Then there is some PgN and some c G A 
such that c begins two different cycles c and Ci in R, both of length P. 
Proof: h(R, a) > 0, so there is some c G A which belongs to two different 
cycles in R; say b = (6?, b° Qo ) and b 1 = (&!,&£,..., 6jjJ, where 

Qo, Qi G N and i>5 = c = &^ . Let P := lcm(Q , Qi)- Let c (resp. Ci) be the 
cycle obtained by chaining together P/Qo copies of b (resp. P/Qi copies 
of bi). Then Co and Ci are distinct cycles of length P, both starting with c. 

O Claim 1 
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Claim 2: Suppose h(R, a) > and /i(L, a) > 0. Then there is some P G N 
and some r, C £ A such that r begins two different cycles r and r% in R, and 
[ begins two different cycles \q and li in L, with all four cycles having length 
P. 

Proof: Claim 1 yields two cycles c R , cf in R, say of length Pr, beginning 
with the same symbol, say r. The same argument also yields two cycles 
Cg, of in L, say of length Pl, beginning with the same symbol, say (. Let 
P := lcm(P R ,P L ). Let r (resp. r^) be obtained by chaining together P/Pr 
copies of c R (resp. cf). Let 1 (resp. lx) be obtained by chaining together 
P/Pl copies of Cq (resp. c{"). O ciaim 2 

Let Hf C R + be the set of all right-infinite sequences made by concate- 
nating copies of r and ri, and let T := {0, 1}. Define (3 R : T — >{r ,ri} by 
P B {t) := r t for t = 0, 1. Define bijection 0% : T N — >Rf by (3 R (t u t 2 , t 3 , ...): = 
[/3 R (ti) (3 R (t 2 ) P R (t 3 ) . . .}. Clearly, (3 R oo = o p o (3 R . In this way, we can en- 
code any binary sequence with an element of Hf. Likewise, let L^f C L 
be the set of all left-infinite sequences made from 1q and L, define j3 L : 
T — >{l ,h} by p L (t) := L for t = 0, 1, and define bijection 0* : T" N — >L^ 
by (3 R (. . . , t_ 3 , t_ 2 , f_i) := [. . . P L (t- 3 ) (3 L (t- 2 ) P L (t-i)]- Clearly, (3 L o a" 1 = 
o~ p o0 L . 

(a) "TMl,r di TM": Every (L,R)-Turing machine is clearly also an 
{A z , „4. Z )-Turing machine, which can clearly be simulated by a classical Tur- 
ing machine with tape alphabet A. 

"TM ^ TMl,r" : If M = (T>, r, T, V) is a classical Turing machine with tape 
alphabet T, then one tape symbol to lies directly 'underneath' the head. In 
contrast, in an (L, R)-Turing machine M* = ('D*,T Il ,T a ,T R ,T itt ,V*), we only 
have tape symbols to the left and right sides. However, if the head state 
domain T>% is large enough, then the head of M* can temporarily 'remember' 
the value of the symbol t , even though t is not written anywhere on the tape. 
So, let P, :=I>xTx Lp x Rp. If t :=(... , t_ 2 , t_i, to, ti, t 2 , . . .) and M has 
headstate d G V, then let 1 := . . , t_ 3 , t_ 2 , t-i), let r := [3 R (ti, t 2 , t 3 , . . .), 
and let M* have head state description d* := (d , to, 1*, r*) G V*. The third 
and fourth entries of d* are input buffers; their values (represented by 1* and 
r*) are currently irrelevant. 

Now, suppose M moves right by one step, overwriting t with t^ and chang- 
ing its head state to d' . Then M* moves right by P steps; during which time it 
reads [r 1; . . . , rp] and stores this P-tuple in the fourth entry of <4 (labelled 'r*' 
above), while writing the P symbols of /?z,(*o) to the tape. Finally M* com- 
putes t\ : = /5~ 1 [ri, . . . , rp] and changes its headstate to dl := (d', ti, 1*, r'J. 
(where 1* and are again irrelevant). 

Suppose M moves left by one step, overwriting t with t' Q and changing 
its head state to d' . Then M* moves left by P steps, during which time 
it reads [Cp, . . . , 4] (in reverse order) and stores this P-tuple in the third 
entry of d* (labelled '1*' above), while writing the P symbols of f3 R ( tg) to the 
tape. Finally M* computes t_i := /3~ 1 [/p, . . . , 4] and changes its headstate to 
dl := (d', t_i,l^,r*) (where 1^ and r* are again irrelevant). 
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Thus, the update rule Y* not only must emulate T, but also must implicitly 
compute and (5~ x . Also, the tape rules r L and r c not only must emulate 
r, but also must implicitly compute (3 L ; likewise, the tape rules r c and r R 
must implicitly compute f3 R . 

(b) Suppose h(L,a) = < h(R,a) (the case '7i(L, cr) > = h(R, cr)" is 
analogous) . 

"APDA r< TM LiR .": Let c ,ci be as in Claim 1, and let Rf C R+ be the 
set of all right-infinite sequences made by concatenating copies of c and ci. 
Now define a bijection : T N — >~R>f, and use to build an (L, R)-Turing 
machine which can emulate a given APDA with stack alphabet T, as in part 
(a). 

"TMl,r di APDA": L is a-periodic, so by passing to a higher power presen- 
tation, we can assume L contains only constant sequences. At this point, any 
(L, R)-Turing machine is clearly an APDA. 

(c) "TMl,r =< AFA": If h(L,a) = = h(R,a), then both L and R are 
periodic, so by passing to a higher power presentation, we can assume that 
both L and R contain only constant sequences. Thus, the only computation 
performed by an (L, R)-Turing machine is computation of the update rule 
T : T> — >T>; i.e. it is an autonomous finite automaton. 

"AFA 2< TMl,r": Conversely, if T : T> — >T> is an autonomous finite au- 
tomaton, then let M = (T>, r, Y*, V) be the (L, R)-Turing machine where 

Y : A x V x A — >A is defined by Y*(c,d,r) := T(d), and the functions 
V , t l , t c , and t r are not important (for simplicity, assume they are con- 
stants). Then clearly, the 'head dynamics' of (T>, r, Y*, V) is an emulation of 

Y : V — >T>. □ 



Remarks: (a) If h(L,a) > and h(R,a) > 0, then Propositions 4.1(b) and 
4.2(a) imply that some questions about the long-term behaviour of an (L, R)- 
defect particle are formally undecidable. For example, the question of whether 
the defect particle eventually stops moving is equivalent to the Halting Prob- 
lem. Sutner [Sut03] has identified similar undecidability issues for defect be- 
haviour. 

(b) The (P L , j3 R ) -encoding mechanism in Proposition 4.2 is quite crude; a 
much more efficient encoding could be obtained using finite state codes [LM95, 
Ch.5]. 

(c) In the standard definition, a Turing machine tape has only a finite seg- 
ment of nontrivial information; we do not assume this. Likewise, in a standard 
pushdown automaton, the stack has finite (but unbounded) height, whereas 
our definition allows an infinitely high stack. 

(d) Let M be an APDA. When moving to the right (i.e. into the R-segment), 
M acts like a finite automaton with state domain T>, reading an .4-valued 
input stream and producing no output. When moving to the left (i.e. into the 
constant 0-segment), M acts like an autonomous finite automaton with state 
domain V x A and update rule Y : V x A — >T> x A defined by T(d , r) : = 
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(T(0, cC , r), r(d , r)). A runaway cycle for M is an T-periodic orbit {{d p , r p )}p =1 
[i.e. f(d p ,r p ) = (cC p+1 ,T p+ i) and f(d P ,r P ) = (di,n)] such that V(d p ,r p ) = -1 
for all p £ [1...P]. In this case, M moves leftwards forever, and essentially 
belongs to the Ballistic regime of §2. Not every APDA has a runaway cycle. 
However, a variation of the Pumping Lemma shows that, if M moves leftward 
for long enough, it must enter a runaway cycle. Also, if M has a runaway cycle 
which is reachable from any initial conditions, and if p £ A4(R + ) is a Bernoulli 
measure with full support, then for V p r £ R + , an APDA with stack r will 
eventually enter a runaway state (see §3 for definitions of 'Bernoulli' and £ V P '). 

(e) By combining the arguments of Theorem 3.3 and Propositions 4.1(a) 
and 4.2(b), we can show that a defect with a $-fixed domain on one side and 
a $-resolving subshift on the other behaves like a pushdown automaton driven 
by a Markov process. This is the 'Markov Pushdown Automaton' regime in 
Table 2. 

(f) The obvious multidimensional analogy of Theorem 4.1 involves a multi- 
dimensional Turing machine [HU79, §7.5]. However, the problem of encoding 
a multidimensional bit array using a multidimensional subshift of finite type 
(analogous to the (/3 L , j3 R ) -encoding mechanism in Proposition 4.2) becomes 
much more complex. 

(g) A completely different mechanism for universal computation has been 
implemented using the (ballistic) defect dynamics of ECA#110; see [Coo04], 
[McI99b] or [Wol02, Chap. 11]. 

Conclusion 

We have described the propagation of defects under the action of cellular au- 
tomata, but many questions remain. For example, we assumed that the defects 
remain bounded in size, and act like 'particles', as is the case in well-known 
examples such as ECAs #54, #62, #110, and #184. In general, however, de- 
fects may grow over time like 'blights' which invade the whole lattice. What are 
necessary /sufficient conditions for the defect to remain bounded? (In general, 
this is probably formally undecidable; see [Sut03, Thm.3.2].) 

Our theory is limited to one-dimensional subshifts of finite type. This excludes 
some important cases (such as ECA #18), where the invariant subshift is sofic. 
Can our theory be extended to sofic shifts? (Eloranta's 'invariant subalphabet' 
approach covers some sofic shifts by passing to a higher power presentation; 
see [EN92,Elo93a,Elo93b]). 

Even when L and R are subshifts of finite type, we only understand defect 
dynamics in the polar opposite cases of 'extreme order' (i.e. L and/or R are 
^-periodic) and 'extreme chaos' (i.e. L and/or R are $-resolving, and endowed 
with Parry measures). We have been conspicuously silent about the so-called 



31 



'complicated' regime in Table 2. In this regime, pretty much anything can 
happen. To see this, let £ and 1Z be two disjoint finite alphabets, and let 
$£ : £ z — >C Z and $n : 1Z % — >7Z Z be any two cellular automata with local 
rules of radius 1. Let A := C U 7Z, and let $ : A z — >A Z be any radius- 
1 cellular automata such that *&\ cZ — an d ^\ n z = Let a := [1 r] 
where 1 6 £(-°°...o) an( j r ^ ^,[o...oo). then a a zero-width (L, R)-defect, 
where L := C z and R := 7?. z . This defect must persist over time and can 
move either left or right with unit speed. If $ has local rule <p '■ A^ 1 — >A, 
then the defect's next move is determined by the restriction of <fi to the set 
^[-i- 1 ] y^I- 1 - 1 ! u7?J~ L1 ]). However, the long-term behaviour of the defect also 
depends on the dynamics of the CA $c) and (7Z Z , which determine 
the 'input signals' which drive the defect. Thus, the defect's behaviour is 
potentially at least as complicated as the dynamics of any one- dimensional 
CA, which could be very complicated indeed. 

However, perhaps if we control the topological dynamics of (£ z , $c) and 
(JZ z ,^Ti), we can extend the classification of Table 2. For example, perhaps 
we could weaken the assumption of '^-periodic' to 'equicontinuous' in the Bal- 
listic and machine-emulating regimes, or perhaps we could replace 'right /left- 
resolving' with 'positively expansive' in the Diffusive regime. Also, if 
and (TZ' 1 , themselves manifest emergent defect dynamics, then perhaps 
we can analyze the behaviour of the (L, R)-defect through its interaction with 
these other defect particles (just as the Brownian motion of a macromolecule 
is driven by a continual bombardment of micromolecules) . 

Finally, can a comparable theory of defect particle kinematics be developed 
for subshifts of TL D for D > 1? Higher-dimensional shifts also admit infinitely 
extended defects shaped like 'curves' or 'surfaces' [Elo95,Piv07a,Piv07b]; what 
sort of motion do they exhibit? A general theory is probably hopeless: even 
interface curves in a two-dimensional boolean CA exhibit a bewildering variety 
and complexity of behaviour [GG98, §3-§6]. However, perhaps some special 
cases are tractable. 
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